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ABSTRACT 


25  ZIKV  is  an  arbovirus  and  member  of  the  family  Flaviviridae  it  is  transmitted  throughout  Africa 

26  and  Asia  and  is  currently  causing  an  outbreak  in  South  America.  Here  we  present  the  complete 

27  genome  sequences  of  two  Zika  Virus  (ZIKV)  strains,  Zika  virus/H.sapiens- 

28  tc/THA/2014/SV0127-14  and  Zika  virus/H.sapiens-tc/PHL/2012/CPC-0740,  isolated  from  the 

29  blood  of  patients  collected  in  Thailand,  2014  and  the  Philippines,  2012.  Sequencing  and 

30  phylogenetic  analysis  showed  that  both  strains  belong  to  the  Asian  lineage. 

31 

32  Key  words:  Zika,  Flaviviridae,  Flavivirus,  whole  genome  sequence 

33 


34  Zika  Virus  (ZIKV)  has  garnered  worldwide  attention  as  researches  have  linked  the  virus  to  an 

35  increase  in  microcephaly  cases  during  the  current  outbreak  in  South  America  (1).  Once  thought 

36  to  cause  mild  infections,  ZIKV  is  now  the  subject  of  intensive  worldwide  research  collaborations 

37  and  efforts.  ZIKV  is  a  single-stranded  positive-sense  RNA  arbovirus  and  a  member  of  the 

38  Flaviviridae  family,  which  includes  dengue,  yellow  fever,  St.  Louis  encephalitis,  Japanese 

39  encephalitis  and  West  Nile  viruses  (2).  The  ZIKV  genome,  approximately  1 1  kilobases  in 

40  length,  is  similar  in  its  arrangement  to  other  members  of  Flaviviridae  containing  5’  and  3’ 

41  untranslated  regions  flanking  a  single  open  reading  frame  (ORF).  The  5’  and  3’  untranslated 

42  regions  are  thought  to  be  important  for  host  interaction,  viral  replication,  and  pathogenicity.  The 

43  ORF  codes  for  three  structural  proteins:  the  capsid  (C),  premembrane/membrane  (prM)  and 

44  envelope  (E)  and  seven  nonstructural  proteins  (NS  1,  NS2A,  NS2B,  NS3,  NS4A,  2K,  NS4B  and 

45  NS5),  which  are  responsible  for  viral  replication  and  assembly  (3).  Previous  phylogenetic 

46  analysis  based  on  the  nucleotide  sequences  of  ZIKV  indicated  two  major  lineages:  African  and 

47  Asian  (2). 

48  These  two  ZIKV  isolates  were  obtained  by  intrathoracic  inoculation  of  ZIKV  real-time  RT-PCR 

49  positive  patient’s  serum  samples  into  Toxorhynchites  splendens  mosquitoes  followed  by 

50  inoculation  of  mosquito  derived  C6/36  cells  (4,  5).  Viral  RNA  was  extracted  using  the  Direct- 

51  zol  RNA  extraction  kit  (Zymo  Research),  converted  to  cDNA  using  Superscript  III  (Invitrogen) 

52  and  amplified  using  sequence-independent  single-primer  amplification  (6)  combined  with 

53  primers  for  rapid  amplification  of  cDNA  ends  (7).  Sequencing  libraries  were  constructed  using 

54  PrepX  ILM  32  DNA  Library  Kit  (Wafergen)  and  sequenced  by  using  Illumina  NextSeq  platform 

55  (2x151  bp).  Adaptors  and  primers  were  clipped  from  the  sequence  reads  using  Cutadapt  version 

56  1 .2 1  (8)  and  low-quality  reads/bases  were  filtered  using  Prinseq-lite  version  0.20.4  (1)  ZIKV 


57  consensus  genomes  were  assembled  using  Ray  Meta  (9),  Bowtie2  v.  2.0.6  (10)  and  Samtools  v. 

58  0.1.18  (11). 

59  The  complete  genome  sequence  with  a  total  length  of  10,807  nucleotides  (nt)  containing  the  5’ 

60  (107  nt)  and  3’  (428  nt)  UTRs  and  one  ORF  (10,272  nt)  were  obtained  from  both  isolates  (12). 

61  Our  sequences  and  other  ZIKV  sequences  from  GenBank  were  used  in  maximum-likelihood 

62  phylogenetic  analysis  by  using  PhyML  3.1  (13)  with  GTR+G  model  (-InL  33630.580);  the  tree 

63  revealed  that  these  two  isolates  belong  to  the  Asian  lineage  and  are  closely  related  to  2015 

64  Brazilian  isolates.  This  confirms  our  previous  analyses  based  of  NS5  genes  of  these  two  isolates 

65  (4,  5).  Data  suggests  that  Asian  lineage  isolates  are  95-98%  identical  on  the  nt  level  and  98-99% 

66  identical  on  the  amino  acid  level.  Nevertheless,  some  substitutions  were  found  within 

67  primer/probe  binding  sites  in  the  genomes  of  these  two  isolates  sequences.  The  two  mismatches 

68  found  on  Zika  virus/H.sapiens-tc/PHL/2012/CPC-0740  genome  for  primer  835  (residue  1/23), 

69  and  for  probe  1107  (residue  1/3 1)  binding  sites  (14).  There  are  two  mismatches  found  on  Zika 

70  virus/H.sapiens-tc/THA/2014/SV0127-14  genome  for  primer  911c  (residue  21/22),  and  for  probe 

71  1 107  (residue  19/3 1)  binding  site  (14).  Further  ZIKV  genome  studies  and  comparisons  will  not 

72  only  elucidate  factors  involved  in  the  virulence  and  pathogenicity  of  ZIKV,  but  will  also  lend 

73  insight  into  the  evolution  of  this  virus  and  help  with  vaccine  design. 

74  Nucleotide  sequences  accession  numbers.  The  assembled  complete  genome  sequences  of  the 

75  Zika  virus/H.sapiens-tc/THA/2014/SV0127-14  and  Zika  virus/H.sapiens-tc/PHL/2012/CPC- 

76  0740  isolates  were  submitted  to  GenBank  under  the  accession  numbers  KU68 108 1  and 

77  KU681082,  respectively.  The  versions  described  in  this  paper  are  the  third  versions. 

78 
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